Skip to content

Copybara import of the project:#8143

Closed
copybara-service[bot] wants to merge 1 commit intomasterfrom
test_738898954
Closed

Copybara import of the project:#8143
copybara-service[bot] wants to merge 1 commit intomasterfrom
test_738898954

Conversation

@copybara-service
Copy link
Copy Markdown
Contributor

@copybara-service copybara-service bot commented Mar 27, 2025

Copybara import of the project:

--
7a85278 by EJ Park eunjpark@quicinc.com:

scripts to generate x32 transposec for HVX

--
1c50176 by EJ Park eunjpark@quicinc.com:

[WIP] tranpose for x32, vector_width=32

--
faa8310 by EJ Park eunjpark@quicinc.com:

Temp changes to print out

--
057264d by EJ Park eunjpark@quicinc.com:

Some generated kernels are removed.

--
8238faf by EJ Park eunjpark@quicinc.com:

scripts and hvx.c.in changed temporarily

--
a255441 by EJ Park eunjpark@quicinc.com:

Trying scalar implementation for tail part

--
f078770 by EJ Park eunjpark@quicinc.com:

Almost working version

Some tail case is not working properly.

e.g.,
34x34 matrix:
We do HVX transpose 32x32, then 32x2, 2x32 and 2x3 with scalar code.
Output has slight mismatch for the last row.

--
0b73071 by EJ Park eunjpark@quicinc.com:

Fixed the tail code

Now applying input_offset and output_offset works both for HVX code and scalar tail code.

--
51c3e8b by EJ Park eunjpark@quicinc.com:

Remove print statements in test

--
f0e0eb7 by EJ Park eunjpark@quicinc.com:

Revert unnecessary changes for this commit

--
95c6828 by EJ Park eunjpark@quicinc.com:

Change to output, input pointer

The code to reset output and input pointer will only be executed if we go to the next iteration. Otherwise, skip.

--
4f7a434 by EJ Park eunjpark@quicinc.com:

Change the way to use include

Rather than using <>, change to use "" to include xnnpack header files.

--
4d46eb6 by EJ Park eunjpark@quicinc.com:

Add prod_microkernels in Cmake build

Add kernels used in configs into cmake build for hexagon

FUTURE_COPYBARA_INTEGRATE_REVIEW=#7395 from ejparkqc:x32-transposec 4d46eb6

@copybara-service copybara-service bot force-pushed the test_738898954 branch 9 times, most recently from f22be42 to 2272333 Compare April 1, 2025 23:52
--
7a85278 by EJ Park <eunjpark@quicinc.com>:

scripts to generate x32 transposec for HVX

--
1c50176 by EJ Park <eunjpark@quicinc.com>:

[WIP] tranpose for x32, vector_width=32

--
faa8310 by EJ Park <eunjpark@quicinc.com>:

Temp changes to print out

--
057264d by EJ Park <eunjpark@quicinc.com>:

Some generated kernels are removed.

--
8238faf by EJ Park <eunjpark@quicinc.com>:

scripts and hvx.c.in changed temporarily

--
a255441 by EJ Park <eunjpark@quicinc.com>:

Trying scalar implementation for tail part

--
f078770 by EJ Park <eunjpark@quicinc.com>:

Almost working version

Some tail case is not working properly.

e.g.,
34x34 matrix:
We do HVX transpose 32x32, then 32x2, 2x32 and 2x3 with scalar code.
Output has slight mismatch for the last row.

--
0b73071 by EJ Park <eunjpark@quicinc.com>:

Fixed the tail code

Now applying input_offset and output_offset works both for HVX code and scalar tail code.

--
51c3e8b by EJ Park <eunjpark@quicinc.com>:

Remove print statements in test

--
f0e0eb7 by EJ Park <eunjpark@quicinc.com>:

Revert unnecessary changes for this commit

--
95c6828 by EJ Park <eunjpark@quicinc.com>:

Change to output, input pointer

The code to reset output and input pointer will only be executed if we go to the next iteration. Otherwise, skip.

--
4f7a434 by EJ Park <eunjpark@quicinc.com>:

Change the way to use include

Rather than using <>, change to use "" to include xnnpack header files.

--
4d46eb6 by EJ Park <eunjpark@quicinc.com>:

Add `prod_microkernels` in Cmake build

Add kernels used in configs into cmake build for hexagon

FUTURE_COPYBARA_INTEGRATE_REVIEW=#7395 from ejparkqc:x32-transposec 4d46eb6
PiperOrigin-RevId: 738898954
@copybara-service copybara-service bot closed this Apr 9, 2025
@copybara-service copybara-service bot deleted the test_738898954 branch April 9, 2025 04:18
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant